Optimizing Directed Acyclic Graphs via Simulated Annealing for Reconstructing Human Segmental Duplications
نویسنده
چکیده
Segmental duplications, relatively long and nearly identical regions, prevalent in the mammalian genome, are successfully modeled by directed acyclic graphs. Reconstructing the evolutionary history of these genomic regions is a non-trivial, but important task, as segmental duplications harbor recent primate-specific and human-specific innovations and also mediate copy number variation within the human population. Using novel models derived by Kahn and Raphael, we formalize this reconstructon task as an optimization problem on the space of directed acyclic graphs. We employ a simulated annealing heuristic and describe an efficient way to use the technique to solve the optimization problem in general. We apply the heuristic to both maximum parsimony and maximum likelihood evolutionary models. We use these models to analyze segmental duplications in the human genome and reveal subtle relationships between these blocks.
منابع مشابه
Parsimony and likelihood reconstruction of human segmental duplications
MOTIVATION Segmental duplications > 1 kb in length with >or= 90% sequence identity between copies comprise nearly 5% of the human genome. They are frequently found in large, contiguous regions known as duplication blocks that can contain mosaic patterns of thousands of segmental duplications. Reconstructing the evolutionary history of these complex genomic regions is a non-trivial, but importan...
متن کاملDAGchainer: a tool for mining segmental genome duplications and synteny
SUMMARY Given the positions of protein-coding genes along genomic sequence and probability values for protein alignments between genes, DAGchainer identifies chains of gene pairs sharing conserved order between genomic regions, by identifying paths through a directed acyclic graph (DAG). These chains of collinear gene pairs can represent segmentally duplicated regions and genes within a single ...
متن کاملAutomated memory-aware application distribution for Multi-processor System-on-Chips
Mapping of applications on a Multiprocessor System-on-Chip (MP-SoC) is a crucial step to optimize performance, energy and memory constraints at the same time. The problem is formulated as finding solutions to a cost function of the algorithm performing mapping and scheduling under strict constraints. Our solution is based on simultaneous optimization of execution time and memory consumption whe...
متن کاملUsing Simulated Annealing for Hard Real-Time Design-to-Time Scheduling
We present a two-parts scheduling algorithm based on simulated annealing and dynamical dispatching for the design-to-time scheduling of hard real-time applications. A decision-DAG (directed acyclic graph) is calculated and optimized offline using simulated annealing and then interpreted during runtime. The underlying system model consists of non-preemptable, periodic tasks with release times an...
متن کاملLearning Inclusion-Optimal Chordal Graphs
Chordal graphs can be used to encode dependency models that are representable by both directed acyclic and undirected graphs. This paper discusses a very simple and efficient algorithm to learn the chordal structure of a probabilistic model from data. The algorithm is a greedy hillclimbing search algorithm that uses the inclusion boundary neighborhood over chordal graphs. In the limit of a larg...
متن کامل